A Simulation Study of the Model Evaluation Criterion MMRE

نویسندگان

  • Tron Foss
  • Erik Stensrud
  • Barbara A. Kitchenham
  • Ingunn Myrtveit
چکیده

The Mean Magnitude of Relative Error, MMRE, is probably the most widely used evaluation criterion for assessing the performance of competing software prediction models. It seems obvious that the purpose of MMRE is to assist us to select the best model. In this paper, we have performed a simulation study demonstrating that MMRE does not select the best model. The consequences are dramatic for a vast body of knowledge in software engineering. The implications of this finding are that the results and conclusions on prediction models over the past 15-25 years are unreliable and may have misled the entire software engineering discipline. We therefore strongly recommend not using MMRE to evaluate and compare prediction models. Instead, we recommend using a combination of theoretical justification of the models we propose together with other metrics proposed in this paper. Index terms Mean magnitude of relative error, simulation, regression analysis, prediction models, software engineering.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Simulation of Water Balance Components Using a Distributed Hydrological Model in Taleghan Watershed

Water changes information in the hydrological system, in time and space, as an environmental issue takes heed of managers and decision makers in watershed management and river engineering, which can be addressed by using spatially distributed modeling. In this study simulation of water balance components in Taleghan mountainous watershed is performed using the spatially distributed hydrological...

متن کامل

Studies of Confidence in Software Cost Estimation Research Based on the Criterions MMRE and PRED

Dan Port (UHawaii) Vu Nguyen (USc) Tim Menzies (WVU) [email protected], [email protected], [email protected] Jan 19, 2009 ABSTRACT Confidence in cost estimation is different than model accuracy. It is related to the significance of results based on model accuracy measures such as MMRE and PRED. A lack of confidence places uncertainty in the accuracy for predicted values and the conclusions based on ...

متن کامل

ارزیابی کارایی مدل WetSpa در شبیه‌سازی فرسایش و انتقال رسوب معلق در آبخیز طالقان

Evaluation of hydrologic behaviour and soil erosion as an environmental crisis is important in order to maintain watershed ecological safety at optimum level. The aim of this study is to evaluate the performance of the distributed hydrological WetSpa model in simulating erosion and sediment transport and also sedigraph in Taleghan watershed, Iran. Base digital maps and daily meteorological time...

متن کامل

A New Empirical Model to Increase the Accuracy of Software Cost Estimation (TECHNICAL NOTE)

We can say a software project is successful when it is delivered on time, within the budget and maintaining the required quality. However, nowadays software cost estimation is a critical issue for the advance software industry. As the modern software’s behaves dynamically so estimation of the effort and cost is significantly difficult. Since last 30 years, more than 20 models are already develo...

متن کامل

Modeling and Simulation of Polyhydroxybutyrate Production by Protomonas extorquens in Fed-batch Culture

Modeling and simulation of Polyhydroxybutyrate (PHB) production by Protomonas extorquens in fed-batch culture were conducted in this research. The fed-batch model, developed for this process, employed a kinetic model proposed by other researchers. Several kinetic models were investigated to choose the best model. The criterion for this selection was goodness of fit (δ2). Haldane kinetic model w...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IEEE Trans. Software Eng.

دوره 29  شماره 

صفحات  -

تاریخ انتشار 2003